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PATENT APPLICATION: US/09/717,095 TIME: 20:11:15 

Input Set : N:\Crf3\RULE60\09717095.txt 
Output Set: N:\CRF4\02112003\l717095.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Vinik, Aaron I. 

6 Pittenger, Gary L. 

7 Rafaeloff, Ronit 

8 Rosenberg, Lawrence 

9 Duguid, William P. 

12 (ii) TITLE OF INVENTION: INGAP PROTEIN INVOLVED IN PANCREATIC 

13 ISLET NEOGENESIS 
15 (iii) NUMBER OF SEQUENCES: 7 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: Banner & Witcoff, Ltd. 

19 (B) STREET: 1001 G Street, N.W. 

20 (C) CITY: Washington 

21 (D) STATE: D.C. 

22 (E) COUNTRY: US 

23 (F) ZIP : 20001-4597 

25 (v) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

27 <B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

29 (D) SOFTWARE: Patentln Release #1.0, 
31 (vi) CURRENT APPLICATION DATA: 

C — > 32 (A) APPLICATION NUMBER: US/09/717,095 

C — > 33 (B) FILING DATE: 22-Nov-2000 

34 (C) CLASSIFICATION: 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: US/08/709,662 

38 (B) FILING DATE: 09-SEP-1996 
4 0 (viii) ATTORNEY /AGENT INFORMATION: 

41 (A) NAME: Kagan, Sarah A. 

42 (B) REGISTRATION NUMBER: 32,141 

43 (C) REFERENCE/DOCKET NUMBER: 00570.59178 
4 5 (ix) TELECOMMUNICATION INFORMATION: 
46 (A) TELEPHONE: 202-508-9100 

"47 ~ • (B) TELEFAX: 2 02-508--92 99. . 

50 (2) INFORMATION FOR SEQ ID NO: 1: 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 747 base pairs 

54 (B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : single 

56 (D) TOPOLOGY: linear 
58 (ii) MOLECULE TYPE: cDNA 



ENTERED 



Version #1.25 
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60 
62 
64 
65 
67 
68 
69 
72 
74 
75 
76 
78 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Cricetulus 
(ix) FEATURE: 



(A) NAME /KEY : CDS 



(B) LOCATION: 20.. 541 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CTGCAAGACA GGTACCATG ATG CTT CCC ATG ACC CTC TGT AGG ATG TCT TGG 

Met Leu Pro Met Thr Leu Cys Arg Met Ser Trp 
15 10 
ATG CTG CTT TCC TGC CTG ATG TTC CTT TCT TGG GTG GAA GGT GAA GAA 



100 



52 



79 Met 


Leu 


Leu 


Ser 


Cys 


Leu 


Met 


Phe 


Leu 


Ser 


Trp 


Val 


Glu 


Gly 


Glu 


Glu 




o U 






lo 










zu 










ZD 








82 TCT 


CAA 


AAG 


AAA 


CTG 


CCT 


TCT 


TCA 


CGT 


ATA 


ACC 


TGT 


CCT 


CAA 


GGC 


TCT 


148 


oo Ser 


Gin 


Lys 


Lys 


Leu 


Pro 


Ser 


Ser 


Arg 


He 


Thr 


Cys 


Pro 


Gin 


Gly 


Ser 




Q A 




q n 
jU 










OO 










4 U 










bo GTA 


GCC 


TAT 


GGG 


TCC 


TAT 


TGC 


TAT 


TCA 


CTG 


ATT 


TTG 


ATA 


CCA 


CAG 


ACC 


196 


87 Val 

Q Q 
OO 


Ala 


Tyr 


Gly 


Ser 


Tyr 


Cys 
50 


Tyr 


Ser 


Leu 


He 


Leu 
55 


He 


Pro 


Gin 


Thr 




90 TGG 


TCT 


AAT 


GCA 


GAA 


CTA 


TCC 


TGC 


CAG 


ATG 


CAT 


TTC 


TCA 


GGA 


CAC 


CTG 


244 


91 Trp 


Ser 


Asn 


Ala 


Glu 


Leu 


Ser 


Cys 


Gin 


Met 


His 


Phe 


Ser 


Gly 


His 


Leu 




92 60 










65 










70 










75 




94 GCA 


TTT 


CTT 


CTC 


AGT 


ACT 


GGT 


GAA 


ATT 


ACC 


TTC 


GTG 


TCC 


TCC 


CTT 


GTG 


292 


95 Ala 


Phe 


Leu 


Leu 


Ser 


Thr 


Gly 


Glu 


He 


Thr 


Phe 


Val 


Ser 


Ser 


Leu 


Val 




96 








80 










85 










90 






98 AAG 


AAC 


AGT 


TTG 


ACG 


GCC 


TAC 


CAG 


TAC 


ATC 


TGG 


ATT 


GGA 


CTC 


CAT 


GAT 


340 


99 Lys 


Asn 


Ser 


Leu 


Thr 


Ala 


Tyr 


Gin 


Tyr 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 




100 






95 










100 










105 








102 CCC 


TCA 


CAT 


GGT 


ACA 


CTA 


. CCC 


AAC 


GGA 


. AGT 


GGA 


TGG 


AAG 


TGG 


AGC 


AGT 


388 


103 Pro 


Ser 


His 


Gly 


Thr 


Leu 


Pro 


Asn 


Gly 


Ser 


Gly 


Trp 


Lys 


Trp 


Ser 


Ser 




104 




110 










115 










120 










106 TCC 


AAT 


GTG 


CTG 


ACC 


TTC 


TAT 


AAC 


TGG 


GAG 


AGG 


AAC 


CCC 


TCT 


ATT 


GCT 


436 


107 Ser 


Asn 


Val 


Leu 


Thr 


Phe 


Tyr 


Asn 


Trp 


Glu 


Arg 


Asn 


Pro 


Ser 


He 


Ala 




108 


125 










130 










135 












110 GCT 


GAC 


CGT 


GGT 


TAT 


TGT 


GCA 


GTT 


TTG 


TCT 


CAG 


AAA 


TCA 


GGT 


TTT 


CAG 


484 


111 Ala 


Asp 


Arg 


Gly 


Tyr 


Cys 


Ala 


Val 


Leu 


Ser 


Gin 


Lys 


Ser 


Gly 


Phe 


Gin 




112 140 










145 










150 










155 




114 AAG 


TGG 


AGA 


GAT 


TTT 


AAT 


TGT 


GAA 


AAT 


GAG 


CTT 


CCC 


TAT 


ATC 


TGC 


AAA 


532 


115 Lys 


Trp 


Arg 


Asp 


Phe 


Asn 


Cys 


Glu 


Asn 


Glu 


Leu 


Pro 


Tyr 


He 


Cys 


Lys 




116 








160 










165 










170 






118 TTC 


AAG 


GTC 


TAGGGCAGTT 


CTAATTTCAA CAGCTTGAAA ATATTATGAA 






581 


119 Phe 


Lys 


Val 































122 GCTCACATGG ACAAGGAAGC AAGTATGAGG AT T CAC T CAG G AAG AGC AAG CTCTGCCTAC 641 
~ T24 "ACACCCACAC GAATT-CCCTT- ATATCATCTC ,TGCTGTTTTT CTATCAGTAT ATTCTGTGGT 701 

126 GGCTGTAACC TAAAGGCTCA GAGAACAAAA ATAAAATGTC ATC AAC " ~ " ~" 747-- 

129 (2) INFORMATION FOR SEQ ID NO: 2: 

131 (i) SEQUENCE CHARACTERISTICS: 

132 (A) LENGTH: 174 amino acids 
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133 (B) TYPE: amino acid 

.134 (D) TOPOLOGY: linear 

136 (ii) MOLECULE TYPE: protein 

138 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



I/O 

X H U 


ixie n 


Leu 


Pro 




i nr 


Leu 


Cys 


Arg 


Met 


Ser 


Trp 


Met 


Leu 


Leu 


Ser 


Cys 


1 zl 1 
X *1 X 


1 

X 








D 










1 u 










lo 




I 4 3 


Leu 




rne 


Leu 


Ser 


Trp 


val 


Glu 


Gly 


Glu 


Glu 


Ser 


Gin 


Lys 


Lys 


Leu 


1 A A 
X ft fi 


















£ D 










JU 






14b 


Pro 


Cor 

ber 


oer 


Arg 


lie 


inr 


Cys 


Pro 


Gin 


Gly 


Ser 


Val 


Ala 


Tyr 


Gly 


Ser 


14 7 
















a n 


















149 


Tyr 


Cys 


Tyr 


Ser 


Leu 


He 


Leu 


He 


Pro 


Gin 


Thr 


Trp 


Ser 


Asn 


Ala 


Glu 


150 




50 










55 










60 










152 


Leu 


Ser 


Cys 


Gin 


Met 


His 


Phe 


Ser 


Gly 


His 


Leu 


Ala 


Phe 


Leu 


Leu 


Ser 


153 


65 










70 










75 










80 


155 


Thr 


Gly 


Glu 


He 


Thr 


Phe 


Val 


Ser 


Ser 


Leu 


Val 


Lys 


Asn 


Ser 


Leu 


Thr 


156 










85 










90 










95 




158 


Ala 


Tyr 


Gin 


Tyr 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 


Pro 


Ser 


His 


Gly 


Thr 


159 








100 










105 










110 






161 


Leu 


Pro 


Asn 


Gly 


Ser 


Gly 


Trp 


Lys 


Trp 


Ser 


Ser 


Ser 


Asn 


Val 


Leu 


Thr 


162 






115 










120 










125 








164 


Phe 


Tyr 


Asn 


Trp 


Glu 


Arg 


Asn 


Pro 


Ser 


He 


Ala 


Ala 


Asp 


Arg 


Gly 


Tyr 


165 




130 










135 










140 










167 


Cys 


Ala 


Val 


Leu 


Ser 


Gin 


Lys 


Ser 


Gly 


Phe 


Gin 


Lys 


Trp 


Arg 


Asp 


Phe 


168 


145 










150 










155 










160 


170 


Asn 


Cys 


Glu 


Asn 


Glu 


Leu 


Pro 


Tyr 


He 


Cys 


Lys 


Phe 


Lys 


Val 







171 165 170 

173 (2) INFORMATION FOR SEQ ID NO: 3: 

17 5 (i) SEQUENCE CHARACTERISTICS: 

17 6 (A) LENGTH: 175 amino acids 

177 (B) TYPE: amino acid 

178 (D) TOPOLOGY: linear 
180 (ii) MOLECULE TYPE: protein 
182 (iv) ANTI-SENSE: NO 

184 (vi) ORIGINAL SOURCE: 

185 (A) ORGANISM: Rattus rattus 

188 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



190 


Met 


Leu 


His 


Arg 


Leu 


Ala 


Phe 


Pro 


Val 


Met 


Ser 


Trp 


Met 


Leu 


Leu 


Ser 


191 


1 








5 










10 










15 




193 


Cys 


Leu 


Met 


Leu 


Leu 


Ser 


Gin 


Val 


Gin 


Gly 


Glu 


Asp 


Ser 


Pro 


Lys 


Lys 


194 








20 










25 










30 






196 


He 


Pro 


Ser 


Ala 


Arg 


He 


Ser 


Cys 


Pro 


Lys 


Gly 


Ser 


Gin 


Ala 


Tyr 


Gly 


197 






35 










4.0 










45 








199 


Ser 


Tyr 


Cys 


Tyr 


Ala 


Leu 


Phe 


Gin 


He 


Pro 


Gin 


Thr 


Trp 


Phe 


Asp 


Ala 


200 




50 










55 










60 










202 ' 


Glu 


Leu" Ala 


Cys- 


Gin 


-Lys 


-Arg 


Pro. 


.Gl.u_.Gly_ His 


Leu 


Val 


Ser 


Val 


Leu 


203 


65 










70 










75 










80 ~ 


205 


Asn 


Val 


Ala 


Glu 


Ala 


Ser 


Phe 


Leu 


Ala 


Ser 


Met 


Val 


Lys 


Asn 


Thr 


Gly 


206 










85 










90 










95 


208 


Asn 


Ser 


Tyr 


Gin 


Tyr 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 


Pro 


Thr 


Leu 


Gly 
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209 100 105 110 

211 Gly Glu Pro Asn Gly Gly Gly Trp Glu Trp Ser Asn Asn Asp He Met 

212 115 120 125 

214 Asn Tyr Val Asn Trp Glu Arg Asn Pro Ser Thr Ala Leu Asp Arg Gly 

215 130 135 140 

217 Phe Cys Gly Ser Leu Ser Arg Ser Ser Gly Phe Leu Arg Trp Arg Asp 

218 145 150 155 160 

220 Thr Thr Cys Glu Val Lys Leu Pro Tyr Val Cys Lys Phe Thr Gly 

221 165 170 175 
223 (2) INFORMATION FOR SEQ ID NO: 4: 

225 (i) SEQUENCE CHARACTERISTICS: 

226 (A) LENGTH: 175 amino acids 

227 (B) TYPE: amino acid 

228 (D) TOPOLOGY: linear 
230 (ii) MOLECULE TYPE: protein 

232 (vi) ORIGINAL SOURCE: 

233 (A) ORGANISM: Homo sapiens 

236 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

238 Met Leu Pro Pro Met Ala Leu Pro Ser Val Ser Trp Met Leu Leu Ser 

239 15 10 15 

241 Cys Leu Met Leu Leu Ser Gin Val Gin Gly Glu Glu Pro Gin Arg Glu 

242 20 25 30 

244 Leu Pro Ser Ala Arg He Arg Cys Pro Lys Gly Ser Lys Ala Tyr Gly 

245 35 40 45 

247 Ser His Cys Tyr Ala Leu Phe Leu Ser Pro Lys Ser Trp Thr Asp Ala 

248 50 55 60 

250 Asp Leu Ala Cys Gin Lys Arg Pro Ser Gly Asn Leu Val Ser Val Leu 

251 65 70 75 80 

253 Ser Gly Ala Glu Gly Ser Phe Val Ser Ser Leu Val Lys Ser He Gly 

254 85 90 • 95 

256 Asn Ser Tyr Ser Tyr Val Trp He Gly Leu His Asp Pro Thr Gin Gly 

257 100 105 110 

259 Thr Glu Pro Asn Gly Glu Gly Trp Glu Trp Ser Ser Ser Asp Val Met 

260 115 120 125 

262 Asn Tyr Phe Ala Trp Glu Arg Asn Pro Ser Thr He Ser Ser Pro Gly 

263 130 135 140 

265 His Cys Ala Ser Leu Ser Arg Ser Thr Ala Phe Leu Arg Trp Lys Asp 

266 145 150 155 160 

268 Tyr Asn Cys Asn Val Arg Leu Pro Tyr Val Cys Lys Phe Thr Asp 

269 165 170 175 
271 (2) INFORMATION FOR SEQ ID NO: 5: 

273 (i) SEQUENCE CHARACTERISTICS: 

274 (A) LENGTH: 174 amino acids 

275 (B) TYPE: amino acid 

276 (D) TOPOLOGY: linear 

278 " "(ii) MOLECULE "TYPE: pro Fein " " " ~ - — — ■ 

280 (vi) ORIGINAL SOURCE: 

281 (A) ORGANISM: Rattus rattus 

284 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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286 


Met 


Leu 


Pro 


Arg 


Val 


Ala 


Leu 


Thr 


Thr 


Met 


OCX 


lip 


Mai- 
ne L 


Leu 


Leu 


Car 
OCX 


287 


1 








5 










10 










X o 




<£. O Zf 


Ser 


Leu 


Met 


Leu 




OCX 


r;i n 


V d JL 




o jl y 


m n 


/lop 


nld 


Lys 


ulU 


7\ o 


290 








20 










25 










O VJ 








vai 


XX \J 


Thr 

1111. 




riX y 




Del 




r 1^ (J 


Lys 


C~\ \7 

VjjLy 


Cav 


/ix g 


H.±cL 


•Pur 

lyr 


PI \7 

biy 


6 y J 






o <J 










4 n 










4 R 












i yr 


^ Y ^> 


T\/T 

i yr 


nld 


lieu 


IT I 1C 


O C X 


vai 


Car 

Del 


Lys 


q^r 

OCX 


T rr\ 

irp 




7\ c? T^l 

-rtsp 


i-via 


296 




50 










55 










60 










^, _?o 


no \-J 


Xjc U. 


£\J- ct 






Lys 


A rrr 


riu 


Q a V 

OCX. 


uly 


Hi c? 
nib 


Leu 


Val 


Qa r 
ocx 


vai 


Leu 


299 

*L Zs Z? 


65 










7D 










7 S 










o u 


jui 


OCX 


oiy 


Q a r 
ocx 


kjlU 


A 1 3 
r\± a 


O /-\ -y~ 


rne 


v ai 




oer 


Leu 


T 1 o 

lie 


Lys 


Ca v- 


C a v* 

oer 


biy 












85 




















Q R 






Asn 


Ser 


Gly 


Gin 


Asn 


Val 


Trp 


He 


uiy 


Leu 


nis 




r ro 


i nr 


Leu 


biy 










100 










105 










i i n 

1 1U 






jU / 


Gin 


Glu 


Pro 


Asn 


Arg 


Gly 


Gly 


Trp 


Glu 


Trp 


Ser 


Asn 


Aia 


Asp 


vai 


rQe L 








115 










120 










LZD 










Asn 


Tyr 


Phe 


Asn 


Trp 


Glu 


Thr 


Asn 


Pro 


Car 

oer 


Car 

oer 


vai 


C a v- 

oer 




Ca v 

oer 


HIS 


Oil 




130 










135 










1 yl n 
1 4 u 










JlJ 


Cys 


Gly 


Thr 


Leu 


Thr 


Arg 


Ala 


Ser 


Gly 


rne 


Leu 


Arg 


Trp 


Arg 


blU 


Asn 




145 










150 




















i £n 


jIO 


Asn 


Cys 


He 


Ser 


Glu 


Leu 


Pro 


Tyr 


Val 


Cys 


Lys 


Irne 


Lys 


Aia 






jl / 










165 










i 7 n 
1 / u 














O 1 Z> \ ) 


INFORMATION FOR SEQ : 


ID NO: 6: 






















(i) 


SEQUENCE CHARACTERISTICS: 






















(A) 


LENGTH 


: 174 amino acids 
















^97 




(B) 


TYPE: amino acid 




















^94 

O^ H 




CD) 


TOPOLOGY: ; 


linear 




















J/O 


(ii) 


MOLECULE TYPE: protein 






















(vi) 


ORIGINAL SOURCE 




























(A) 


ORGANISM: Rattus rattus 
















^9 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


u • 














^4 

O O *i 


Met 


Leu 


Pro 


Arg 


Leu 


Ser 


Phe 


Asn 


Asn 


v ai 


JCI 


irp 


Th r 

i nr 


Leu 


Leu 


Tyr 


-J -J 


1 








5 










1 n 

1 u 










1 R 

1 D 




o ~j i 


Tyr 


Leu 


Phe 


He 


Phe 


Gin 


Val 


Arg 


Gly 


LjI U 


7\ cn 
riop 


Q a r 
OCX 


bin 


Lys 


Rid 


vai 


338 








20 










25 










O \J 






340 


Pro 


Ser 


Thr 


Arg 


Thr 


Ser 


Cys 


Pro 


Met 


vjjl y 


C a -r 


liys 


AT ^ 


i yr 


7\ r rr 
rlx y 


Q a r 
OCX 


341 






35 










40 










4 S 








"34 3 

J*! J 


Tyr 


Cys 


Tyr 


Thr 


Leu 


Val 


Thr 


Thr 


Leu 


Lys 


Car 

oer 


Trp 


irne 


bin 


/ii a 


Asp 


344 




50 










55 










60 










346 


Leu 


Ala 


Cys 


Gin 


Lys 


Arg 


Pro 


Ser 


Gly 


His 


Leu 


Val 


Ser 


He 


Leu 


Ser 


347 


65 










70 










75 










80 


349 


Gly 


Gly 


Glu 


Ala 


Ser 


Phe 


Val 


Ser 


Ser 


Leu 


Val 


Thr 


Gly 


Arg 


Val 


Asn 


350 










85 










90 ' 










95 




352 


Asn 


Asn 


Gin 


Asp 


He 


Trp 


He 


Trp 


Leu 


His 


Asp 


Pro 


Thr 


Met 


Gly 


Gin 


353 








100 










105 










110 






355' ~ 


"Gin 


"Pro"" 


Asn" 


'Gly" Gly" Gly" 


"Trp ' 


Glu' 


Trp 


Ser- 


Asn 


Ser 


Asp 


Val- 


Leu 


Asn 


356 






115 










120 










125 








358 


Tyr 


Leu 


Asn 


Trp Asp Gly Asp 


Pro 


Ser 


Ser 


Thr 


Val 


Asn 


Arg 


Gly 


Asn 


359 




130 










135 










140 
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